Papers
arxiv:2411.07238

OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model

Published on Nov 11, 2024
Authors:
,
,

Abstract

OpenThaiGPT 1.5, a Thai language chat model based on Qwen, achieves top performance on Thai language tasks with features like multi-turn conversations, RAG compatibility, and tool-calling.

AI-generated summary

OpenThaiGPT 1.5 is an advanced Thai language chat model based on Qwen v2.5, finetuned on over 2,000,000 Thai instruction pairs. This report provides an engineering perspective on the model's development, capabilities, and performance. We discuss the model's architecture, training process, and key features, including multi-turn conversation support, Retrieval Augmented Generation (RAG) compatibility, and tool-calling functionality. Benchmark results demonstrate OpenThaiGPT 1.5's state-of-the-art performance on various Thai language tasks, outperforming other open-source Thai language models. We also address practical considerations such as GPU memory requirements and deployment strategies.

Community

Sign up or log in to comment

Models citing this paper 7

Browse 7 models citing this paper

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2411.07238 in a dataset README.md to link it from this page.

Spaces citing this paper 1

Collections including this paper 1